CDS

Accession Number TCMCG042C14820
gbkey CDS
Protein Id XP_016446956.1
Location join(7309..7593,8722..8769,9369..9545,9633..9783,9887..9973,10080..10204,10582..10700,11199..>11310)
Gene LOC107771994
GeneID 107771994
Organism Nicotiana tabacum

Protein

Length 368aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016591470.1
Definition PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like, partial [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category M
Description procollagen-lysine 5-dioxygenase activity
KEGG_TC -
KEGG_Module -
KEGG_Reaction R07376        [VIEW IN KEGG]
KEGG_rclass RC00017        [VIEW IN KEGG]
RC02950        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K08730        [VIEW IN KEGG]
EC 2.7.8.29        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00564        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00564        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATCAGAGAAGGGAGGCACAAGCTAACGAGATTAATGGGAGTCGGAGCAATGAAAACGACAGCGTTTCGTCGAAGGGGCCAGCGCTGCGGCTGTACCCATGTGTAGAGAAGAAGGCGGAGAAGTATGAGGATTTAGAAGAAGAATTGGAGTTCAGCCCACATCTATACAGTGCTCTTGAGCGGCATCTTCCGACCAGCGTCCTCAGTTCATCTCGAGACAACAAGGTCCAATACATGACTGATATTCTCCTCCGTTACTCTCCCCGCTCCGATCGCAGTCGCTTGCAGAAACATGGAGAATACAGGCAGAAAATCATATCAAACTATCAGCCTCTACATAGGGTGTTATATACCATGCACGCTGCAGATTTCTTTGTGCCTTCATTTATTAAGGCGATCAGTGAGAATACGGAGGAAAGCTTCAGAAAAATAATGTCTGAAGCTTCTCCAGGTGTTTTTACATTTGAAATGCTTCAACCACGTTTTTGTGAGATGATGTTGGCTGAGGTACAAAACTTTGAGAAATGGGTTCGTGAAACAAAATTCAGAATCATGCGCCCCAATACTATGAACAAATTTGGATCCGTTCTTGATGATTTTGGCCTTCAAAACATGCTTCAGAAGTTTATGGAAGATTTTATACGCCCTATTTCAAGAGTTTTTTTTACTGAAGTTGGTGGATCCACACTCGATGGTCATCATGGTTTTGTCGTTGAGTATGGGACAGACAGAGACATTGACTTGGGTTTCCATGTTGATGATGCGGAGGTCACTTTGAATGTGTGCTTAGGAAAGCAATTCACAGGTGGAGAGTTGTTCTTTCGAGGTGTGCGGTGCGAGAAGCATGTGAATTCTGATACACAACCAGAGGAGATCTTTGATTATGCGCATATCGCGGGGCGTGCAATTCTTCATTGTGGTCGCCATAGGCATGGTGCTAGAGCGACAACATCTGGGCAGAGGATCAACTTGTTGATATGGTGCAGAAGCTCCGTTTTCAGAGAAATGAGGAAGTACCAAACAGATTTTCCTAGCTGGTGTGCAGAGTGCAAACGTGAGAAGGAAGAAAGGATACGGCAGAAAGTTTCTACTCTCAAATCG
Protein:  
MDQRREAQANEINGSRSNENDSVSSKGPALRLYPCVEKKAEKYEDLEEELEFSPHLYSALERHLPTSVLSSSRDNKVQYMTDILLRYSPRSDRSRLQKHGEYRQKIISNYQPLHRVLYTMHAADFFVPSFIKAISENTEESFRKIMSEASPGVFTFEMLQPRFCEMMLAEVQNFEKWVRETKFRIMRPNTMNKFGSVLDDFGLQNMLQKFMEDFIRPISRVFFTEVGGSTLDGHHGFVVEYGTDRDIDLGFHVDDAEVTLNVCLGKQFTGGELFFRGVRCEKHVNSDTQPEEIFDYAHIAGRAILHCGRHRHGARATTSGQRINLLIWCRSSVFREMRKYQTDFPSWCAECKREKEERIRQKVSTLKS